Benchmarking Scalable Predictive Uncertainty in Text Classification

نویسندگان

چکیده

This paper explores the question of how predictive uncertainty methods perform in practice Natural Language Processing, specifically multi-class and multi-label text classification. We conduct benchmarking experiments with 1-D convolutional neural networks pre-trained transformers on six real-world classification datasets which we empirically investigate why popular scalable estimation strategies ( Monte-Carlo Dropout , xmlns:xlink="http://www.w3.org/1999/xlink">Deep Ensemble ) notable extensions xmlns:xlink="http://www.w3.org/1999/xlink">Heteroscedastic xmlns:xlink="http://www.w3.org/1999/xlink">Concrete underestimate uncertainty. motivate that benefits from combining posterior approximation procedures, linking it to recent research ensembles variational Bayesian navigate loss landscape. find our proposed method combination by analysis in- domain calibration, cross-domain classification, novel class robustness, demonstrates superior performance, even at a smaller ensemble size. Our results corroborate importance fine-tuning dropout rate task hand, individually as an impacts model robustness. observe ablation severely underperform novelty detection, limiting applicability transfer learning when distribution shift classes can be expected.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Personalized News Categorization Through Scalable Text Classification

Existing news portals on the WWW aim to provide users with numerous articles that are categorized into specific topics. Such a categorization procedure improves presentation of the information to the end-user. We further improve usability of these systems by presenting the architecture of a personalized news classification system that exploits user’s awareness of a topic in order to classify th...

متن کامل

Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

Deep neural networks (NNs) are powerful black box predictors that have recently achieved impressive performance on a wide spectrum of tasks. Quantifying predictive uncertainty in NNs is a challenging and yet unsolved problem. Bayesian NNs, which learn a distribution over weights, are currently the state-of-the-art for estimating predictive uncertainty; however these require significant modifica...

متن کامل

Ensembles of Sparse Multinomial Classifiers for Scalable Text Classification

Machine learning techniques face new challenges in scalability to large-scale tasks. Many of the existing algorithms are unable to scale to potentially millions of features and structured classes encountered in web-scale datasets such as Wikipedia. The third Large Scale Hierarchical Text Classification evaluation (LSHTC3) evaluated systems for multi-label hierarchical categorization of Wikipedi...

متن کامل

Scalable text classification as a tool for personalization

We consider scalability issues of the text classification problem where by using (multi)-labeled training documents, we try to build classifiers that assign documents into classes permitting classification in multiple classes. A new class of classification problems; called ‘scalable’, is introduced, with applications on web mining. Scalable classification utilizes newly classified instances in ...

متن کامل

Benchmarking short text semantic similarity

Short Text Semantic Similarity measurement is a new and rapidly growing field of research. “Short texts” are typically sentence length but are not required to be grammatically correct. There is great potential for applying these measures in fields such as Information Retrieval, Dialogue Management and Question Answering. A dataset of 65 sentence pairs, with similarity ratings, produced in 2006 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3168734